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Abstract 

The HSP70 family of molecular chaperones function to maintain protein quality control and homeostasis. The major stress- 
induced form, HSP70 (also called HSP72 or HSPAIA) is considered an important anti-cancer drug target because it is 
constitutively overexpressed in a number of human cancers and promotes cancer cell survival. All HSP70 family members 
contain two functional domains: an N-terminal nucleotide binding domain (NBD) and a C-terminal protein substrate- 
binding domain (SBD); the latter is subdivided into SBDa and SBDP subdomains. The NBD and SBD structures of the 
bacterial ortholog, DnaK, have been characterized, but only the isolated NBD and SBDa segments of eukaryotic HSP70 
proteins have been determined. Here we report the crystal structure of the substrate-bound human HSP70-SBD to 2 
angstrom resolution. The overall fold of this SBD is similar to the corresponding domain in the substrate-bound DnaK 
structures, confirming a similar overall architecture of the orthologous bacterial and human HSP70 proteins. However, 
conformational differences are observed in the peptide-HSP70-SBD complex, particularly in the loop p that bridges SBDot 
to SBDp, and the loop Ll,i that connects the SBD and NBD. The interaction between the SBDa and SBOP subdomains and 
the mode of substrate recognition is also different between DnaK and HSP70. This suggests that differences may exist in 
how different HSP70 proteins recognize their respective substrates. The high-resolution structure of the substrate-bound- 
HSP70-SBD complex provides a molecular platform for the rational design of small molecule compounds that preferentially 
target this C-terminal domain, in order to modulate human HSP70 function. 
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Introduction 

The HSP70 family proteins represent an evoIutionarUy 
conserved group of molecular chaperones that are important for 
maintaining protein quality control and protein homeostasis. They 
were first identified more than thirty years ago in DrosophUa as 70 
kD proteins that were induced by heat stress or other potentially 
lethal stimuli [1,2], and found to be critical for maintaining cell 
survival [3,4]. Subsequendy, other HSP70 family proteins were 
identified in both prokaryotes and eukaryotes, with many 
members also shown to be constitutively expressed and exhibit 
important housekeeping functions [5-8]. Among the many 
activities of HSP70 family proteins are the chaperoning of nascent 
polypeptides and unfolding of misfolded protein substrates, the 
facilitation of protein transport to organelles, the protection and/ 
or dissolution of multi-protein complexes, and the targeting of 
some misfolded proteins for degradation [5,6,9-12]. HSP70 
proteins are considered among the most conserved proteins in 



evolution as they are found in all kingdoms from archaebacteria to 
humans. While most prokaryotes have only one HSP70 gene, 
some gram-negative bacteria and all eukaryotes encode several 
HSP70 proteins. For example, Escherichia coli has three HSP70 
proteins: DnaK, HscA(Hsc66) and HscC(Hsc62), Saccharomyces 
cerevesiae encodes at least ten family members, and Homo sapiens 
encodes at least eight paralogs [5-8,13,14]. Some family members 
are thought to serve tissue-specific or organelle-limited roles; some 
are constitutively expressed, and still others are stress-induced. 
Current evidence suggests that certain HSP70 family members 
may serve overlapping, or specific, functions in a cell or organism 
[6,15,16]. Thus, a better understanding of the functional diversity 
of the HSP70 family proteins would benefit from greater insight 
regarding their structure-activity relationships. 

The stress inducible human protein HSP70 (also called 
HSPAIA/AIB, HSP70-1 and HSP72) is of particular interest 
because it is considered a cancer-critical survival protein [17,18]. 
Unlike the closely related, but constitutively expressed HSC70 



PLOS ONE I www.plosone.org 



1 



July 2014 I Volunne 9 | Issue 7 | e103518 



Structure of NRLLLTG-Bound HSP70 



(also known as HSPA8, Hsp70-8 and HSP73) protein, HSP70 is 
not essential for viability, as knockout mice for HSP70 are viable 
and fertile [19]. Additionally, unlike HSC70, HSP70 is expressed 
at very low levels in unstressed normal cells but is rapidly up- 
regulated under a variety of stress conditions. Importantly, it is 
constitutively overexpressed in most human cancer cells, and its 
elevated expression correlates with resistance to therapy and poor 
prognosis [6,13,15,16]. Evidence indicates that, among its cancer- 
supporting activities, HSP70 protects cells from apoptosis and the 
prc)t(;c)toxic stress associated with oncoproteins and aberrant 
m("tab<)lism [20-22]. Aci:or(lingly. this molecular chaperone has 
emerged as an attractive therapeutic target, and several groups 
have focused efforts on the identification of HSP70 inhibitors for 
use in cancer therapy. To date, however, relatively few effective, 
well-characterized modulators of HSP70 activities have been 
reported [7,8,22-30]. In general, a lack of structural information 
on the human HSP70 protein has slowed the development of more 
effective, clinically useful inhibitors. It is expected that generating 
new structural information on human HSP70 should aid in the 
development of such modulators. 

HSP70 binds to small hydrophobic stretches of amino acids in 
nascent or partially folded substrates; together with the actions of 
critical co-chaperones, it directs the substrates to a particular fate, 
such as re-folding or degradation. HSP70 proteins share a similar 
overall structure, comprised of an N-terminal nucleotide binding 
domain (NBD) that, on its own, exhibits modest ATPase activity, 
and a C-terminal peptide substrate-binding domain (SBD). The 
actions of the two major domains of the HSP70 proteins are 
allosterically regulated. In the presence of ADP, model substrate 
peptides such as NRLLLTG or client proteins interact with high 
affinity. However, when ATP is bound to the NBD, substrate 
binds significantly more weakly. The NBD is subdivided into four 
subdomains, which are partitioned into two lobes (I and II) by a 
central ATP/ADP binding pocket [31,32]. The SBD is composed 
of a two-layered fi-sandwich (SBDP), which contains the peptide 
binding pocket, and an a-helical subdomain (SBD a) that functions 
as a Ud, covering the substrate binding cleft [33,34]. The SBDP 
and SBDa subdomains are connected by the loop L(x,|3> which 
shows limited species conservation. The NBD and SBD are 
bridged by a highly conserved interdomain linker (also known as 
loop Ll,i), which has been imphcated as being critically involved 
in modulating the allosteric regulation of HSP70 proteins [35-39]. 
The extreme C-terminal domain of HSP70 is believed to be 
largely unstructured, and is the docking site for some co- 
chaperones [6]. 

Among the HSP70 family proteins, E. coli DnaK has been the 
most extensively studied member at the structural level, though 
there is some structural information about the mammalian heat 
shock cognate (HSC70) member as well. Indeed, much of our 
mechanistic understanding of HSP70 structure and activity has 
come from analyses of E. coli DnaK. Several structures of the 
DnaK NBD [40] and SBD region, either alone or in complex with 
various client peptides [33,34,41-44], have been determined. 
Several near full-length DnaK structures, bound to either ADP or 
ATP in the presence or absence of substrates, were also more 
recently determined [37,39,45,46]. In the past two decades, 
several structures of isolated NBDs of HSC70 in the absence of 
nucleotides and in the presence of ADP or ATP as well as the 
isolated SBDa subdomain of HSC70 have been determined 
[45,47-51]. Finally, a near full-length, nucleotide-free bovine 
HSC70 X-ray structure (residues 1-54; PDB code lYUW) has 
been reported [52]. 

Significandy less is known about the structure of the stress- 
inducible human HSP70, particularly of the SBD. To date, the 



following structures of human HSP70 have been determined: the 
nucleotide-bound or apo-form NBD [53-58], the isolated NBD in 
complex with the inhibitor VER- 155008 (PDB code 4I08) [59], 
and the isolated C-terminal SBDot (residues 537-610 [60]; or 
residues 534—615, PDB code 3LOF). No structure is available of a 
complete HSP70 SBD, which includes both SBDP and SBDa, and 
no structure has been reported of the HSP70 SBD in complex with 
a substrate. Here we present the 2 A crystal structure of the stress 
inducible human HSP70 (residues 386-616) bound to a hepta- 
peptide model substrate, NRLLLTG, used in previous DnaK 
protein structural analyses. This high-rc-solution structure repre- 
sents the first of an intact eukaryotic HSP70 SBD; it provides a 
previously unavailable molecular platform to use in structure- 
function analyses and should aid in the rational design of HSP70- 
specific modulators that preferentially target the SBD of this 
important molecular chaperone. 

Materials and Methods 

Protein Expression and Purification 

Constructs of human HSP70 (residues 1-641, 386-616, 391- 
615, and 395-510) were cloned into pET25 (EMD Millipore 
Chemicals, Inc., Billerica, MA, USA), between the Ndel and Xhol 
restriction sites, for expression as N-terminal Hisg-tagged fusion 
proteins in E. coli. All plasmid inserts were verified by DNA 
sequencing. The E. coli BL21 Star (DE3) competent cells 
(Invitrogen catalog number C60 10-03) were transformed with 
the respective expression plasmids. The resulting strains were 
grown at 37°C in LB medium containing 50 |J,g/ml of carbeni- 
cillin (Sigma-Aldrich Co., St. Louis, MO, USA). At an ODgo,, 
~0. 3-0.5, protein expression was induced with 0.5 mM IPTG, 
and cells were subsequently grown overnight at 25°C. Cells were 
collected by centrifugation and resuspended in the Bugbuster 
Master Mix (EMD Millipore Chemicals catalog number 71456-4), 
supplemented with protease inhibitors, 1 mM DTT and 20 mM 
imidazole. The Hisg-tagged fusion proteins were isolated on Ni^*- 
chelating rc-sins (Ni-NTA Superflow, Qiagen catalog number 
30410) by standard procedures. The purified proteins were 
dialyzed thoroughly against 10 mM Tris-HCl (pH 7) supplement- 
ed with 5 mM DTT at 4°C. When necessary, size-exclusion 
chromatography on a Superdex-200 or a Superose 6 10/300 GL 
(GE Healthcare) analytical column pre-equilibrated with 10 mM 
Tris-HCl (pH 7) and 5 mM DTT was used as a final purification 
step. The pure proteins were concentrated and microcentrifugated 
at 1 5,000 xg for 5 min at 4°C, the concentration was determined 
by Bradford Protein Assay and aliquots of the soluble proteins 
were stored at — 80°C. 

Crystallization, Data Collection, and Structure 

Determination 

We used the hanging drop vapor diffusion technique to co- 
crystaUize the HSP70-SBD with the NRLLLTG peptide. 5 mM of 
NRLLLTG (Biomatik, Wilmington, Delaware, USA) was mixed 
with 400 IXM of HSP70 protein (residues 386-616, 391-615, or 
395-510) or 400 |XM of Seleno-Methionine (SeMet)-derivatized 
HSP70 protein. The protein mixtures were preheated to 42°C for 
15 minute's and gradualh' cooled to room temperature;. The 
protein preparations were subsequently microcentrifugated at 
15,000 Xg for 5 min at room temperature, and the soluble protein 
complexes were used to set up crystallization screens. The best 
crystals were obtained using a HSP70 construct containing 
residues 386-616 from a crystallization solution of 0.1 M Bis-Tris 
(pH 5.5), 0.2 M Li2S04, witii 28-30% PEG3350 in the reservoir 
and 22%~25% PEG3350 in the crystaUization drop. The crystals 
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grow to full size (0.05~0.2 mm in three dimensions) in 2~3 days 
and flash-frozen directly in liquid nitrogen. Data were collected 
using the X29A beamline at the National Synchrotron Light 
Source (Brookhaven National Laboratory'). One data set each for 
native and SeMet-derivatized HSP70-SBD/substrate crystals were 
collected at wavelengths of 1.075 A and 0.9791 A (selenium peak), 
respectively, and then processed with HKL2000 [61]. The data 
were useful to resolution limits of 2 A and 2.5 A, respectively. 
Eight selenium sites from data collected from the SeMet- 
derivatized crystals were identified with the HYSS program [62] 
using the Phenix suite [63]. These sites were input into the 
Phenix/Phaser program [64] and protein phases were determined 
using the same dataset by the single-wavelength anomalous 
dispersion (SAD) method. A high quality electron density map 
was generated and a initial model of 323 amino acid within 16 
fragments were build using Phenix/ Autobuild [65] automatically. 
The structure was refined and manually adjusted using the high 
resolution native dataset l)y iterative cycles of refinement with 
Phenix/Refine [66] and model building with Coot [67] (Table 1). 
AH the structural alignments and figures were prepared with 
Pymol (http:/ /www.pymol.org). 

Protein Data Bank accession code 

Atomic coordinates for the human HSP70 SBD structure have 
been deposited with the Protein Data Bank under the identifier 
code 4po2. 

Results and Discussion 

Overall Structure of the Human NRLLLTG-Bound HSP70- 
SBD 

We set out to determine the high-resolution crystal structure of 
the human HSP70 SBD bound to a client peptide substrate. 
Towards that goal, we were able to prepare well-ordered crystals 
of HSP70 (aa 386-616) bound to a NRLLLTG peptide that 
formed in the C2 space group and diffracted to 2 A resolution. 
The structure of the complex was determined with single 
anomalous diffraction using selenomethionine-derivatized protein 
and refined to 2 A to R„ork and Rfree values of 17.54% and 
21.78%, respectively, with excellent stereochemistry (Fig. lA and 
Table 1). 

The X-ray crystal structure of NRLLLTG-bound HSP70-SBD 
reveals two molecules of the complex per asymmetric unit cell. 
Each molecule contains the characteristic SBDP and SBDa 
subdomains (Fig. lA), as previously observed in the NRLLLTG- 
bound DnaK-SBD [33]. Molecule A (MolA) can be traced from 
Asn387 to Gly613, while Molecule B (MolB) can only be traced 
from Asp395 to Gly613 and contains a gap in the electron density 
map corresponding to residues 506-509 in the loop Lj, p region. 
Based on sequence alignment, the human HSP70 SBD domain 
(residues 391-61,5) corresponds to residues 389-607 of DnaK, 
which was used pre\'iously for structural studies [33]. Since we 
employed residues 386-616 of human HSP70 for structure 
determination here, the human HSP70 SBD structure is five 
amino acids longer at the N-terminus than the DnaK SBD 
previously reported [33]. Additionally, we retained the 6-his tag at 
the N-terminus for cry-stallization. This N-termiiial rc'gion of the 
HSP70 SBD, which represents the interdomain linker between the 
NBD and SBD, forms a short helix on MolA (referred to here as 
aN) but is disordered in MolB. These differences are likely 
mediated by the crystal environment since the N-terminus of 
MolA forms crystal contacts while the corresponding region of 
MolB does not. The corresponding region of intact E.coli DnaK 
forms a (3 strand [39], and the corresponding region of ADP- 



bound intact Geobacillus kaustophilus HTA426 DnaK (gkDnaK) 
forms an extended loop structure [45]; it is also distinct from a 
loop in the nucleotide free NBD of the bovine HSC70 structure 
[52]. Taking this comparison together, suggests that the inter- 
domain linker between the NBD and SBD of human HSP70 is 
most likely to be inherently flexible. 

Comparison of the two molecules in the asymmetrical 
unit cell 

The overall fold of MolA and MolB in the asymmetric unit cell 
is essentially identical (Fig. IB). Each monomer is composed of the 
SBDP subdomain that contains the pep tide-binding pocket, and 
an SBDa subdomain, also known as the lid. A superposition of 
MolA to MolB reveals that the SBDP subdomain contains a root 
mean square deviation (RMSD) of 0.18 A for 110 equivalent Ca 
positions. The SBDa subdomain shows more variability, partic- 
ularly in the three-hehx bundle of the SBDa region, with an 
overall RMSD of 1.7 A for 198 equivalent Ca positions of the 
SBD. The latter observation is consistent with prior studies of 
substrate-bound DnaK structures, which indicated the existence of 
considerable structural variability within the SBDa region 
[33,41,42,69]. 

The linker region between the a and P subdomains (loop Lo, p) is 
of particular interest because it has been shown in the ATP-bound 
DnaK structures to serve as a dynamic hinge that serves to 
appropriately position the long helix aB of the SBDa[37,39]. A 
sequence alignment of this region reveals significant differences 
between E. colt DnaK and human HSP70s. In DnaK, this linker 
region is composed of residues with small side chains (ASSGL or 
SSSGL), while in human HSP70 this linker region is composed of 
conserved amino acids with larger side chains (NDKGRL) (Fig. 2). 
Phylogenetic analysis shows that, unlike other regions of HSP70, 
this region is significantly different between prokar\'otes and 
eukaryotes [70]. In MolA of our structure, this linker is involved in 
crystal packing through interactions with a loop from an adjacent 
molecule (Fig. IC). Specifically, the carboxylate oxygen atom of 
Gly508 hydrogen bonds with arginine and asparagine residues of a 
symmetr)'-related molecule. The side chain of Arg509 also folds 
back to interact with both the backbone of Thr504 and Asn505, 
and the side chain of Asp5()6 and Glu444. In addition, the side 
chain of Asp506 forms hydrogen bonds with Glu444. All these 
interactions hold the linker region in a fixed position, so that the 
electron density map in this region is very clear. In contrast, the 
corresponding linker region in MolB is not observed because the 
adjacent molecule is more than 5 A away and, as a consequence, 
the electron density is poorly defined in this region of MolB 
(Fig. ID). The dramatically different conformations of the linker 
region in the two molecules in the asymmetric unit is consistent 
with the inherent flexibility of this region in human HSP70. 

Comparison to other HSP70 SBD Structures 

The human HSP70 SBD is highly homologous to the DnaK 
SBD (51% sequence identity in the full-length protein and 47% 
identity in the SBD) (Fig. 2). Not surprisingly, the overall fold of 
the HSP70-NRLLLTG structure closely approximates the other 
substrate-bound DnaK SBD structures, with the SBDP subdomain 
showing the greatest d(;gT(X' of structural homology with an 
RMSD of about 0.4 A, depending on which DnaK structure is 
used for the comparison (Fig. 3A). In contrast, loop L^ p is 
significantiy different as there is one more residue inserted in this 
region of the human HSP70 proteins, and, as stated previously, the 
side chains are longer than those in DnaK. The SBDa subdomains 
of the peptide-bound HSP70 and DnaK structures show greater 
structural variability with an average RMSD of about 1.5 A 
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Table 1. Data Collection and Refinement Statistics. 







SeMet-HSP70-NRLLLTG 


Native HSP70-NRLLLTG 


Data collection 


Space group 


C2 


C2 


Cell dimensions 


a, b, c (A) 


100.25, 84.80, 72.15 


100.29, 85.28, 72.20 


a, P, Y n 


90.00, 1 27.20, 90.00 


90.00, 1 26.97, 90.00 


Wavelength 


0.97910 


1.0750 


Resolution (A) 


50-2.50 (2.59-2.50) 


50-2.0 (2.07-2.00) 


l^merge 


8.9 (31.9) 


9.9 (24.5) 


l/cr (1) 


20.8 (6.7) 


20.7 (8.1) 


Data completeness (%} 


100.0 (100.0) 


99.8 (99.4) 


Redundancy 


7.4 (7.5) 


7.5 (7.5) 


No. reflections 


16817 


32837 


Refinement 


Resolution (A) 




50-2.0 (2.07-2.00) 


No. reflections 




32790 


Rwork/Rfree 




17.54/21.78 


Molecules/a. u. 




2 


No. atoms 




4105 


Protein 




3448 


Peptide 




99 


Water 




543 


B-factor (A^) 


Average 




29.59 


Protein 




28.12 


Peptide 




30.89 


Water 




37.07 


r.m.s deviations 


Bond lengths (A) 




0.0040 


Bond angles {') 




0.842 


Ramachandran plot (%) 


Favored regions 




99.3 


Additionally allowed regions 




0.7 



r.m.s., root-mean-square deviation. Values in parenthesis are for the highest resolution shell. 
doi:l 0.1 371 /journal.pone.Ol 0351 8.t001 



(Fig. 3A). Importantly, the modest degree of structural variation 
between the SBD of HSP70 and DnaK is on par with the 
structural variation between the two HSP70 molecules in the 
asymmetric unit, which is consistent with the highly conserved 
structures of the SBD of both human HSP70 and bacterial DnaK. 
This observation is in contrast to previous observations based on 
partial HSC70 SBD structures, which suggested that the HSC70 
SBDa region harbors considerable diflFerences to the correspond- 
ing region of DnaK [49,52]. 

The SBDa subdomain is comprised of five ot-helices (ctA-aE). 
The helix otA connects the loop L^j p to the oiB helix, the C- 
terminal half of which forms an anti-paraUel three-helix bundle 
with helices aC and oiD. The otD helix is contiguous with the (xE 
helix. The structure of HSP70 reveals that the helices within the 
lid are stabilized by extensive van der Waals interactions between 
hydrophobic residues (Fig. 3B). These hydrophobic residues are 
not strictly conserved (including Leu.542, Ala546, Met549, Val553, 
Ile562, Val570, Cys574, Val577, Leu581, Phe592, Leu599, 



Cys603, Ile606, LeuBlO in HSP70), but the corresponding 
positions in DnaK are always occupied by hydrophobic residues 
(Figs. 2). The helix aE in the HSP70-NRLLLTG structure is 
kinked by about 30° because of the hydrophobic interaction 
between this helix and the three helical bundle. This is significandy 
different from helix aE of crystalized DnaK (which is 1 residue 
shorter at the C-terminus than the human HSP70 SBD structure 
reported here), which is kinked by about 60°. Specifically, our 
findings are most consistent with the following eukaryotic HSP70 
SBDa structures: the crystal structures of the human HSP70- 
SBDa helical bundle (residues 534-613; PDB code 3LOF; overall 
Ca RMSD = 0.42 A); the solution structure of human HSP70 
SBDa helical bundle (residues 537-610; PDB code 2LMG; overall 
Ca RMSD = 0.77A) [60] (Fig. 3C); and the C. elegans Hsp70 
SBDa helical bundle (residues 533-614; PDB code 2P32; overall 
Ca RMSD = 0.58 A) [71] (Fig. 3D). Based on the comparison of 
these structures, it appears that eukaryotic HSP70 proteins have a 
different helix aE conformation than DnaK. 
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Figure 1. Overall structure of the human peptide-bound HSP70 SBD. A. Cartoon representation of NRLLLTG-bound HSP70 SBD Molecule A. 
The SBD and NRLLLTG peptide are shown in magenta and blue, respectively, and all domains and secondary structural elements are labeled. B. 
Superposition of molecules A and B in the asymmetric unit cell. Molecules A and B are color-coded magenta and green, respectively. C and D. Hinge 
region between the a and p subdomains. Molecules A and B are shown in panels C and D, respectively. Interacting (MolA) or potential interacting 
(MolB) residues are indicated in stick and CPK coloring (oxygen in red and nitrogen in blue). The electron density maps corresponding to these 
residues are contoured at 1.5 (MolA) and 1.0 (MolB) sigma. Hydrogen bonds are indicated with dotted lines. 
doi:1 0.1 371 /journal.pone.01 0351 S.gOOl 



The previous crystal structure of the rat HSC70 lid region 
revealed a dramatically difTerent organization of the helices, in 
which helices otC, ocD and otE are combined to form a long helix 
[49] (Fig. 3E). Although this structure suggested that the SBDot of 
HSC70 is structurally distinct from that of DnaK, our structure 
shows that human HSP70 adopts the same SBDa fold as DnaK. 
The different helbc organization in HSC70 might be influenced by 
the truncated construct used or the crystal lattice environment. 
This conclusion is bolstered by the analogous structures of the 
three isolated three-helix bundle structures from eukaryotic 
HSP70 discussed above. Taken together, this comparison is 
consistent with a structurally conserved and rigid lid region 
between the eukaryotic and bacterial HSP70 proteins. 

The combined structural data support the notion that the SBDa 
subdomain is inherendy more flexible than the SBDP [33,42,69]. 
This is even true when one compares the SBDot subdomain from 
different molecules in the asymmetric unit of the same crystal 
structure, as illustrated in the comparison between MolA and 
MolB of HSP70 reported here (Fig. IB). On closer inspection, it 
appears that the junction for this variability occurs around the 
center of helix otB where this helix participates in the helical 
bundle of the SBDa subdomain (Fig. 3F). In particular, consistent 
with the DnaK juncture region (536-538) [33], the juncture point 
for this region appears to be around residue Asn540. This residue 
is identical in most HSP70 proteins (except E. coli HscA) and also 
makes conserved interactions with residue Ala406 on the SBD (3 



subdomain. In this HSP70 structure, Asn540 interacts with the 
backbone of Ala406 (3.3 A in MolA, 3.0 A in MolB), which is in 
the Li 2 loop forming the substrate-binchng groove on the SBDP 
subdomain. A helical turn away, Val536, which is either a valine 
or isoleucine residue in HSP70 proteins, makes van der Waals 
contacts with Gly407 and Val409; notably, Gly407 is stricdy 
conserved in all HSP70s and Val409 is conserved on strand |32 of 
the SBDf5 subdomain in most HSP70 proteins, except for E. coli 
HscA (Fig. 2). Val536 is always placed in between Gly407 and 
Val409. Additional hydrogen bond interactions occur between 
Asp529 at the N-terminal end of helix aB and Arg447 of loop L4 5 
of the SBDP subdomain. Together, these residues are highly 
conserved in HSP70 proteins. These conserved interactions fix the 
conformation of the N-terminal half of helix aB. However, the 
interactions in the C-terminal part of helix aB are less conserved. 
For instance, in MolA, residues Glu543 and Ser544, one helical 
turn C-terminal to Asn540 of hehx aB, hydrogen bond with 
Arg469 of loop Lj^g and Tyr431 of loop L3_4 of the SBDP 
subdomain (Fig. 3F). In contrast, these interactions differ in MolB, 
with Arg469 adopting a different conformation that cannot 
contact Glu543 and Ser544. Since Arg469 extends to Phe547, it 
appears that MolB is forming a destabilizing interaction in this 
region. Together, this network of interactions between the N- 
terminal end of helix aB of the SBDa subdomain and the loops of 
the SBDP subdomain serve to rigidly link this region of helix aB as 
well as the preceding helix oA to the SBD fi subdomain, but leaves 
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Figure 2. Sequence alignment of eul<aryotic and bacterial HSP70 proteins. Human HSP70 (HSPAIA or HSP72), HSPA2, HSPA5 (GRP78 or 
BiP), HSC70 (HSPA8 or HSP73), HSPA9 (GRP75, Mortalin-2 or IV1THSP70) and E.coli DnaK (ecDnaK), £ coli HscA, Geobacillus kaustophilus HTA426 DnaK 
(gkDnaK) sequences are used for the alignment. Secondary structure elements and residue numbering for HSP70 is indicated above the protein 
sequence. The hinge region (also known as loop U,p) is highlighted in a red rectangular box, the arch residues are indicated with red asterisks, 
residues involved in NRLLLTG LeuSp substrate binding are indicated with green asterisks, and the juncture region residue Asn540 is indicated with a 
blue asterisk. The conserved hydrophobic residues in the helical bundle region are indicated with red triangles. 
doi:1 0.1 371 /journal.pone.01 0351 8.g002 



the C-terminal part of helix aB and the associated heUcal bundle 
region of the SBDa .subdomain to take on more divergent 
positions when compared to different HSP70 proteins. Interest- 



ingly, DnaK shows a more extensive network of interactions 
between residues within helix aB and the SBDP subdomain. This 
is consistent with the greater degree of structural and functional 



PLOS ONE I www.plosone.org 



6 



July 2014 I Volume 9 | Issue 7 | e103518 



Structure of NRLLLTG-Bound HSP70 




Figure 3. Comparison of tKie substrate-bound HSP70 SBD to other DnaK and HSC70 SBD structures. A. Comparison of the NRLLLTG- 
bound HSP70 SBD to the NRLLLTG-bound DnaK SBD (PDB code 1DKZ). HSP70 and DnaK are shown in magenta and blue cartoons, respectively. The 
corresponding peptide substrates are shown as stick models. B. A close-up view of the hydrophobic interactions within the helical bundle region of 
the oc subdomain. Hydrophobic residues are shown as stick models. C. Comparison of the SBDa subdomain of the human HSP70-NRLLLTG structure 
with the isolated three-helix bundle region of the same protein in X-ray structure (PDB code 3L0F, blue) and NIVIR solution structure (PDB code 2LMG, 
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green). D, Structural alignment of the SBDot subdomain of the HSP70-NRLLLTG structure with the three-helix bundle region of C. elegans Hsp70 (PDB 
code 2P32, cyan) and DnaK (PDB code 1DKZ, blue). E. Superposition of the lid subdomains of human HSP70 and rat HSC70. HSP70 (MolA) and HSC70 
are colored in magenta and cyan, respectively. F. A close-up view of the interaction between the SBDot and SBDp subdomains of HSP70. Interacting 
residues are highlighted as sticks, and hydrogen bonds are indicated with dotted lines. 
doi:1 0.1 371/journal.pone.01 0351 8.g003 



plasticity of the eukaryotic HSP70 proteins relative to the DnaK 
proteins. We hypothesize that the less constrained SBDct 
subdomain of the HSP70 proteins may influence the interaction 
of HSP70 proteins with different co-chaperones such as CHIP 
[72]. 

Substrate Recognition by HSP70 

The NRLLLTG substrate peptide is known to bind to the ADP- 
bound HSP70/DnaK with high affinity [33,73]. However, upon 
exchange of ADP with ATP, the HSP70/DnaK protein undergoes 
a conformational change, switching to the lower-affinity peptide 
binding state [38,73,74]. Similar to the NRLLLTG-DnaK-SBD 
structure, the NRLLLTG peptide substrate binds to the SBDP 
subdomain of the human HSP70 SBD (Fig. 4A). This indicates 
that this structure is representative of the peptide bound to the 
ADP form of full length HSP70. The SBDf5 subdomain is made up 
of two layers of fi sheets that interact through hydrophobic 
interactions (Figs. lA and 4A). The first layer consists of strands 




V476 



fi3, P6, [57 and [58, and the second layer contains strands [55, [54 
and [51, [52 (Fig. IB). These [5-strands are connected by loops Li 2 
through Lg 7. The NRLLLTG peptide binding pocket is situated 
between loops L3 4 and Li 2, and the binding cleft is further 
stabiUzed by loops L4 5 and L5 5 through a series of hydrogen 
bonds and multiple van der Waals contacts (Figs. lA, 4A and 4B). 

The substrate-binding groove of HSP70 shows significant 
conservation with the corresponding groove of DnaK (and other 
HSP70 proteins). Highly conserved residues Leu403 (Ile401 in 
DnaK), Phe428, Val438, Ile440, Ile474 and Val476 at the base of 
the hydrophobic pocket each make van der Waals interactions 
with the central hydrophobic residue of the peptide substrate, 
Leu5p (Fig. 4C). These interactions explain the strong preference 
for a hydrophobic residue, and particularly leucine, in this position 
of HSP70 substrates. The flanking peptide residues make more 
minor interactions with HSP70, while the side chains of Ala406 
and Tyr43 1 form a structurally conserved arch over the surface of 
the pocket and make van der Waals interactions with flanking 




B 



D 



Figure 4. NRLLLTG peptide substrate binding by HSP70. A. Omit electron density of the peptide substrate bound to the p subdomain of 
molecule A is contoured at 3.0 sigma. The NRLLLTG peptide is shown as a stick model in CPK coloring. B. Peptide binding site highlighting the 
flanking L, 2 and L3 4 and supporting Lj ^ and L4 5 loops. The NRLLLTG peptide and arch residues are shown as stick models. C. Detailed view of the 
NRLLLTG-binding pocket in HSP70-NRLLLTG highlighting a network of van der Waals contacts with LeuSp of the NRLLLTG peptide. D. A close-up view 
of the interactions between Leu4p-Leu5p-Thr6p of the NRLLLTG peptide and the surrounding residues in HSP70-NRLLLTG. All interacting residues are 
shown as sticks and hydrogen bonds are shown as dotted lines. 
doi:1 0.1 371/journal.pone.01 0351 8.g004 
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Figure 5. Putative druggable sites in the HSP70 SBD domain. An overview of three hydrophobic pocl<ets within the SBD domain have been 
circled and marl<ed as A, B and C. Close-up of the surface area of these three sites are illustrated in CPK coloring mode, with carbon in green, oxygen 
in red and nitrogen in blue. Secondary structural elements within the area are labeled. The NRLLLTG substrate is removed from this figure (pocket A) 
to better illustrate the pocket A for comparison with the other pockets (B and C). 
doi:1 0.1 371 /journal.pone.01 0351 8.g005 



residues Leu4p and Thr6p (methyl group) (Fig. 4D). Positions 
corresponding to A]a406 and Tyr431 are not conserved within 
DnaK but are typically hydrophobic in DnaK and other HSP70 
proteins (Fig. 2). These arch residues correspond to residues 
Met404 and Ala429 in E. coli DnaK, Met and Phe in E. coli HscA 
and Ser and Ala in E. coli HscC. Taken together, the HSP70 
substrate-binding site appears to be tailored for a hydrophobic 
residue and leucine in particular that is flanked by 2 additional 
residues with hydrophobic character, and this is conserved in 
DnaK and other HSP70 proteins. 

A survey of DnaK SBD X-ray structures with the same peptide 
sequence (i.e. NRLLLTG) reveals that most substrate-bound 
structures contain Leu4p in the hydrophobic pocket [33,42]. 
However, one structure (PDB code 4EZW) also has Leu5p 
modeled into the pocket; and another structure contains the 
peptide NRLILTG instead of the canonical NRLLLTG peptide, 
but bound in the reverse orientation with Leu3p in the 



hydrophobic pocket (PDB code 4EZY) [42]. Analysis of the 
crystal packing arrangements of several DnaK SBD /peptide 
structures does reveal that the crystal-packing contacts could, and 
likely do, influence the register of peptide binding [33,42]. 
Consistent with their broad substrate specificity, we hypothesize 
that the HSP70 isoforms can accommodate various peptide 
registers, and the actual binding mode may depend on the 
composition of the substrate and the microenvironment. 

Implications for Developing HSP70 Inhibitors 

Rodina et al. (2013) recently unveiled 5 potential druggable 
clefts from a theoretical model of full length human HSP70, which 
was generated from the DnaK SBD. Combining the S-score, D- 
score and size of the cavity, they identified 2 sites (sites 1 and 2) in 
the NBD that are considered "most druggable" (Rodina et al., 
2013). Furthermore, they tested a novel inhibitor called YK5 and 
provided evidence, in addition to computer modeling, that it binds 
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to site 1 [30]. Intriguingly, Schlecht et al. recently cocrystaJlized 
the N-terminal NBD domain of HSP70 (residues 1-382) with the 
HSP70 inhibitor VER- 155008 (PDB code 4I08) [59], and in this 
structure the inhibitor binds to site 2 in the NBD. 

The high resolution NRLLLTG-bound HSP70-SBD structure 
reported here provides an opportunity to identify additional 
potential druggable sites within the HSP70 SBD domain. A 
detailed analysis of the structure reveals three potentially 
druggable hydrophobic pockets for the rational development of 
inhibitors (Fig. 5), two of which were previously noted using the 
theoretical model of the human SBD (Rodina et al., 2013). The 
first site (labeled A in Fig. 5) maps to the canonical substrate- 
binding site, referred to here as the Leu5p binding pocket. The 
second site (labeled B in Fig. 5) is distinct from the NRLLLTG- 
binding cleft and is supported by loops L^, p, L2;j, Lg 7 and Ll 1; a 
similar hydrophobic binding site has been reported by Cellitti et al. 
(2009) [68] . A third site (labeled C in Fig. 5) is located in the helix 
bundle region, which is more covert and has been investigated 
recently [75]. Given the importance of these predicted hydropho- 
bic pockets in orchestrating the crosstalk among the NBD, SBDP 
and SBDa domains, and the interaction between the SBD domain 
and other co-chaperones, it is anticipated that small molecule 
inhibitors or inhibitory peptides that preferentially interact with 
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